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ABSTRACT 

This paper illustrates an approach: toward understanding natural language 
through the techniques of artificial intelligence. It explores the struc¬ 
ture of English word-endings both morpho-graphemically and semantically. 

It illustrates the use of procedures and semantic representations in re¬ 
lating the broad range of knowledge a language user brings to bear on 
understanding and utterance. 
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Section 1, introduction 

This paper presents an analysis of some aspects of English 
morphemic structure. It is based on an "artificial intelligence" 
approach to language, and differs from many linguistic descriptions 
in its basic premises of how language can be described. Stated 
over-simply, they are; 

1, The Structure of language can best bo described by modelling 
the interpretation rather than the generation of utterances. 

2, Many parts of the model are best characterized as algorithms 
or procedures which explicitly carry out the Interpretation 
process* rather than process-free rules which Implicitly control 
an interpretive or generative process, 

3, It is impossible to describe major areas of language behavior 
with one level of analysis * Such as syntax, isolated 

from other levels such as semantics, 

4, A linguistic theory should describe how a speaker integrates 
alt of his different areas of knowledge in language use. 

This should include syntax, phonology, semantics, and their 
relation to the speaker's knowledge of the context of the 
utterance and the subject of discourse. 

The framework for expressing the linguistic analysis is a computer 
program. for understanding natural language. [Winograd 1971]. It 
is an integrated system which combines syntax* semantics, and deduction 
within a framework of special languages designed for expressing linguistic 
regularities. It has the services of a powerful general-purpose 
deductive language called PLAINER [Hewitt 1969], which can be used 
for reasoning both about the linguistic forms and the subject matter 
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being discussed. 

The analysis of morpho-graphemic structure given in Section 
2 is an expanded version of the one already used in the system, 

Section 3 describes an analysis of some semantic aspects of English 
word formation* and shows how it would relate to the rest of the system. 
It is important to recognize that this is not simply a proposal for 
a particular computer program. Many of the ideas have grown from 
the use of the computer, hut the basic concept of procedural descriptions 
is independent of any actual computer implementation. It is a flexible 
and useful formal representation for many phenomena , and could well 
be used in linguistic descriptions and theories which had no connection 
to the direct use of computers. 
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Section £, The "Spelling" of English Word Endings 

English has a complex set of "spelling roles"' which describe the 
way inflectional endings are attached to root words- These rules 
enable a reader to recognize that* for example, "pissing" is a form 
of "please."* while "beating" is a form of "heat". There Is a structure 
of conventions for doubling consonants* dropping V* changing "y" 
to "i'\ etc, when adding endings. For spoken language these would 
be called the morpho-phonemic rules, while for written language* 
we can call them morpho-graphemic. 

In any complete analysis of language, a word like "running” 
should not be considered a completely separate lexical entity. 

It is a regular inflected form of "run", and both its form and usage 
can be derived from "run” and an analysis of the inflection, first* 
the analysis shoutd include a formalism for describing the way the 
actual Sequence Of Tetters in the inflected form is related by regular 
rules to the form of the root word- There are various ways this 
might be expressed, describing the generation or analysis of the 
resultant forms, One formal Ism which seems particularly well suited 
is an algorithm for interpreting inflected words. The regularities 
are expressed directly within a process which accepts an unfamiliar 
word as an input* and "'breaks it down'" into a familiar root and an 
ending- 

The flow chart in Figure l describes a procedure designed to 
handle a number of regular endings ' p ~n"t" for negative; "-V and 

for possessive; "-s" and its various forms for plural nouns 
and singular third-person verbs; lk -ing"» "-ed" a and "-en"' verb 















































































forms; the superlative H -est"; the comparative "-er"j and the adverbial 
"-Ty"; 

The description uses a few simple notations in addition to norma] 
flowchart conventions. u Hq u and "yes" answers to branching conditions 
are represented by si ngie and double arrows respectively. The function 
CUTOFF indicates what letters are to be removed from the end of the 
word. The ordinals 11 1st 1 , "2nd* 1 , etc. count letters backwards from 
the end of the word, including only those which have not been cut 
off * Several subclasses of letters are used -- VOWEL is {A £ I D 1i V). 
LIQUID is [L R S V Z), and MGEMD is (C G 5 V Z). The label TRY 
represents the part of the procedure which tries looking up the supposed 
root in the dictionary. It uses whatever Tetters have not been cut off. 

At 5mne places, the program tries an interpretation, then if that 
fails, carries out 4 different analysis and fries again. 

As the flowchart shows, these endings share many aspects of 
morpho^graphemic structure, and the procedural representation is able 
to capture these generalities by having them share parts of the 
procedure, it can also detail those aspects peculiar to each ending. 

This is not a complete description of how these endings appear, hut 
covers the great majority of words, and could easily be expanded 
to treat more special cases. The program can correctly analyze such 
relationships as: hashes - bash, bathes - bathe, leaning - lean, 
leaving - leave, dented - dent, danced - dance, dogs - dog. kisses - kiss, 
curved - curve, curled - curl, rotting - rot s railing - roll, played - play, 
plied - ply, realest - real, palest - pale, knives - knife, prettily - 
pretty, nobly - noble, 

ft Is important to note the way in which exceptions are handled 
in the approach. Since the approach is interpretive, this procedure 


can be viewed as one piece of a more genera] procedure which accepts 
an English utterance and tries to understand It, MorphO-graphemlc 
analysis takes place whenever the procedure encounters a word token 
which It cannot find In the lexicon. An irregular form like “was" 
is In the lexicon directly* and the rules will never be applied in 
trying to either analyze or produce it. This distinction between 
lexical Idiosyncrasies and morpho-graphemic generalities is empirical, 

A productive rule like the one relating '"sing* and “sang", "ring 11 and 
"rang"* might well be included in a more complete procedural description t 
while less productive ones (such as relating "will" to "won't") will 
be left in the form of separate lexical entries. 

It is tempting to see this formalism as a simple finite state 
machinej but this is not applicable for several reasons. First, 
the tests which can be done to a word in deciding on its form are 
not, in general* simple checks of the next input letter. Whether 
a certain analysis Is possible may depend, for example, or how many 
syllables there are in the word* Or On some complex phonological 
calculation involving vowel shifts, Semantic and syntactic information 
are also applicable as will be described later. In the case of multiple 
endings [as in “patronizingly") the program must be applied recursively, 
and must take into account the entire derivation up to the point Of 
analysis. The simplified version shown in Figure 1 does not account for 
these extra levels of complexity, but the view of the procedure as 
a flowchart for a part of an integrated computational system makes it 
easy to integrate them. 

The straightforward morpho-graphemic analysis alone is sufficient 
to do much of tho interpretation. In fact some systems [«.§► [Thorne 1969 1 ]) 
use it to avoid having a dictionary of open class words. The Inflection 
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of a word 1ft the Input determines Its possible syntactic classes * 

People rarely operate at this level of Ignorance. They use their 
lexical knowledge to recognize that "under" is not a comparative form 
of some adjective "urid"* and that "bely” Is not an adverbial form of 
"be". This type of knowledge can readily be integrated into the 
interpretive program. Once a possible analysis is determined, 
the hypothetical root can be checked, in the dictionary. If it is not 
there, the program can try another analysis. Notice In Figure 1 
that a word ending In a double LIQUID* followed by an inflection* 
is first tried with the doubled consonant {as 1ft "rolling - roll 11 ), 
then If that Is not found, another try is made without the doubled 
consonant (as in “patrolling - patrol"). 

In one sense, this "multiple guess" method represents an open 
place in the morpho-graphemit theory. No prediction is made about 
which form will succeed, A more- sophisticated test might be included 
which looked at the number of syllables in the word* deciding that a 
single-syllable word, such as "roll" has the consonant doubled, while 
the multi-syllabic "patrol" does not. It could go further and recognize 
that "unroll" is a derivative of a one-syllable word, and therefore 
behaves like "roll". Within the procedural formalism, such knowledge 
can be included by adding to the procedure calls to further phonological* 
syntactic* or semantic subroutines. 

Whatever the level of analysis In such a theory, the remaining details 
are included as idio&yncracles of the individual lexical items Involved. 
People In general operate at a much lower level of analysis than linguists, 
relying on lexical idiosyn trades b provide information which might 
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instead be represented as generalities involving diachronic linguistic 
facts (e.g. knowing the Old English or Latin origin of a word* and 
understanding the effects of this origin on its morphology}. A theory 
should be able to allow this vague borderline if ft Is to describe 
the actual competence of a Speaker * 

One simple relevant bit of knowledge about a word is its 
syntactic class* Ff £ possible root is found in ttie dictionary* 
its class can be checked to see if the ending is appropriate 
(e.g. an "'-est" can apply only to an adjective}. Of course> a word 
can be In more than one category* and all of the possibilities must 
be considered in the analysis. This check of syntactic class can 
be Included directly into the intepretatfon procedures just as the 
other types of knowledge described above* 


Section 3, Some Semantic Aspects of NDminalization 

The description above includes only a few of the possible endings 
In English. In particular, it involves only those which involve no 
major semantic change to the root. There is a whole world of prefixes 
and suffixes which modify words,-changing both their syntactic class 
and their meaning. A "baker" is a person who bakes, a "painting" is 
an object which is the result of painting, or may be an event ("the painti 
of the house went without trouble/ 1 )* An "invention" is a thing which 
is invented, "detainment" is an event of being detained, and so forth. 

The procedure described in Figure 1 could easily be augmented 
to include the morphographemic behavior of endings like "-tlon' 1 
and "-ment“. It already includes those such as 11 -@r" and "-1ng'\ 
tfhat must be added Is some formal description of the semantic changes 
and relationships implied by the additional morpheme* 

One of the central committments of the theory of language described 
here Is the existence of a formalism for representing the "meaning" 
of words and utterances with respect to an internal! "model of the world" 
held by the language user. This internal model contains a set of 
interrelated "concepts" and procedures for manipulating these in carrying 
out deductions* Our description of language uses a logteal-deductive 
language (PLANNER) to represent this model. This paper will not 
describe the formalism or its justification in detail, but will use 
parts of it to illustrate how problems such as ngminail ration might 
be handled. (See [Wtnograd l&71]for much more detail). 

The dictionary definition of a word may include several meanings. 

Each of these involves a basic expression In the logical formalism, 
and a set of semantic markers (see [Katz and Fodor 1964]) as preliminary 
filters for proposed semantic combinations. The exact status of these 
markers will be discussed later. 
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One possible meaning for bake would involve "cooking in an oven", 
and would be done by a person to some sort of food, The resulting 
formal 1 sin might be {in a simplified version, of the notation used in 
[Winograd 1971]): 

(VERB ((*PERS0N) (*FOQ0)} (COOK if] #2 OVEN) } 

The character Is used to prefix a semantic marker, and the symbols 

"#1" and "12" are used to represent the subject and object of the 
verb. Within the total language understanding system, this definition 
is applied after the actual semantic (deep structure) subject and 
object have been determined!. The semantic markers check Its applicability 
to the particular objects involved, and the resulting logical expression 
{with these objects substituted for "#1" and *#2") is used in building 
up the "semantic structure" which represented the meaning of the sentence. 
There is no necessity to express "bake" in this way. The model might 
include "baking" as a primitive concept, rather than: as an instance 
of "cooking", and relate the two within the deductive knowledge of 
the world-modeK There is no fixed line between the knowledge which 
represents the "definition - of a word, and the knowledge which 
relates that definition to the rest of the language-user's world. 

Returning to the problem of word-endings, we could define an 
interpretive procedure associated with the "-er" ending. Which Operates 
on the definition of a verb to produce a definition for the corresponding 
noun. The "-er" 1 program might Say something like "If the verb has 
an interpretation which involves a person as its subject* then the noun 
represents a person who does that action.' 1 This would be described 
within a formal semantic language which has primitives specially suited 


for manipulating these semantic definitions. The exact form of these 
primitives is a part of a complete theory of semantics* Just as a particular 
set of primitive "transformations" is inherent to a transformational 
theory of syntax. 

The result of analysing lp baker" would then be a noun definition 
{again in simplified notation); 

{NOUN ['"'PERSON) [[COOK *"" 7 OVEN}) ) 

Here, the symbol "**** represents the object being described by the 
noun* and the 11 ?" indicates an unspecified object. The definition 
Is of an object with the semantic marker ""PERSON", and which cooks 
some object in an oven. A more detailed description {see [Winograd 1971] 
Chapter 4) would show how this new definition might include additional 
information, such as retaining the semantic marker restriction of "FOOD 
on the object being baked* or putting a specific time into the abstract 
relationship of J 'cook" T The details for doing this already exist In 
the language-understanding system* and! could be Included In the simple 
program which performs the Semantic analysis of' the "-er" ending. The Program 
of course operates on other verbs In the same way* for example converting 
the definition of "run"; 

{VERB (["PERSON)) [RUN f1) ) 

to 11 runner 1 ": 

[NOUN ["PERSON) [[RUN *"*)) ) 

A similar program for "-ing" might convert "shoot": 

{VERB [f"PERSON) {"OBJECT}} [SHOOT £T #2} ) 

into ‘'shooting": 

{NOUN ["EVENT) {(SHOOT ? 7}) ) 


(again, there are other details of how the event would be referred 
to in the logical representation, etc.}. Mot# how this might fit in 
with the use of a phrase beginning with "of". One definition of 
"of" would say (using the semantic prirsiltlwes) "If the "of" phrase 
modifies an event with unspecified objects* the object of the preposition 
should be substituted for one of the objects." Inherent In this 
definition Is the ambiguity of a phrase like "the shooting of the 
hunters"! since there are two unspecified objects in the definition 
of "the shooting"* and the "of" phrase might apply to either. 

The definition of 11 -er" given above is not very sophisticated. 

You may well Say you are not a "baker 1 " even though you have baked things* 
The nominalized form inctudes the additional idea of doing the activity 
as an occupation. This subtlety can be included by slightly modifying 
the "-er" program* and using the ability of the logical formalism 
to create an arbitrary name for a relationship or event, and refer to 
that name in other relationships. If these arbitrary names are of the 
form "REL" followed by a number* the resulting definition of "baker" 
might be; 

(mm (*persq«) ( (cook *** ? ovew (rel&s)){occupation *** rel&&))) 

The name REL65 is assigned to the relationship involving the personas 
cooking something in the oven* end the second relation shows that the 
first Indeed represents the person’s occupation. 

The program now defined for "-er" is more complete, but ft ignores 
the difference between a ''baker" and a "drinker 11 . "Drink" is also a 
verb with a human subject, but a “drinker" is not a person who drinks 
for a, living. In this case, the “-er" ending represents habitual action. 


It is tempting to once again make use of semantic markers, and classify 
the verbs “bake' 1 and "drink" into two different categories. The "-er" 
program could then decide which definition was, applicable by having 
an appropriate semantic marker restriction for each. This is a dangerous 
route. For each new type of nominal Nation* we may need to introduce 
specialized classifications of verbs, whose only ourpoSe Is to 
determine the meaning of that particular nominalization. 

One way to understand the limitations of a system of semantic 
markers is to see them as a type of "pre-calculation". In a Semantic 
system, a definition may include a proviso like "This meaning of the 
verb applies only if the subject is a *PER50H". This could be implemented 
by having a deductive procedure which is called each time the definition 
is used, and which uses facts about the linguistic forms and about 
the world being discussed. The convenience of semantic markers comes 
in noticing that most words can he associated "permanently" with an 

outcome of this deduction. The word "boy" will refer to an object which 

a. 

can be assumed to be^person simply because of the word used, and without 
further deduction- In addition, many of the required computations can 
be reprsented in a subclass-inclusion tree, so their outcomes can bo 
related by simple logical relations. If the ^PERSON deduction is true 
for an object, then the ANIMATE deduction must bo true, and the 
*PHYSICAL-OBJECT deduction, etc. 

Setting up a tree of semantic markers is then pre-calculating a 
related set of predicates concerning the objects referred to by various 
words. This works quite well as a computational convenience for simple 
objects and calculations. However it fails in two ways For more 
complex and realistic ones* First* as unentloned above* the actual test 


needed to decide whether a given definition Is applicable may be 
quite complex* and limited in use, It seems unsatisfactory to assume 
that predicates like "activity which can be done for a living" 
have been pre-computed and stored in a gigantic marker-tree for each 
word In the lexicon. 

More Important, the basic idea of semantic markers depends on 
the possibility of computing the value of the predicate on the basis 
of the words* outside of any particular context. One Of the most 
Important aspects of language use is the fact that every utterance 
occurs in a context of a situation and other utterances. The speaker 
or writer makes heavy use of this in the way he communicates. 

The decision as to what activities can be done for a living may 
dependi on the situation. In a discussion about people who test 
products for food companiesj a "drinker 1 ' might well have a good job* 

If an artist Is composing a picture of a group of people, he might 
well "raise one of the children". In this context, the usual meaning 
of "raising children" is inapplicable, because the objects being referred 
to with the word "children" are in fact drawings* 

This is often explained by assuming that a word has a "primary” 
meaning, and other uses are not explained by the semantic theory, but 
are done by "analogy" or "extension" of the basic rules* By recognizing 
the computational nature of semantic markers, there Is no need to 
make this artificial distinction. The selection restrictions on 
Syntactic combinations depend not on the specific English words, hut 
on the semantic features of the objects and relationships actually 
being described. Pre-cal oil ated semantic markers are a convenient 
device for handling simple cases, but are only a first-order approximation 
to the semantic organization of language* 


The actual program for interpreting an ending Tike "-er“ will include 
not only ways of manipulating the dictionary definitions described 
above, but also calls to a deductive program which can examine the 
world-model. It can use the context and knowledge of an ‘'intelligent" 
reader to answer the semantic questions needed to Interpret a phrase 
or complex word. Rather than asking “Does the noun have the semantic 
marker *CHILD" P it can ask ^Is the object being referred to by the noun 
actually a child?" 


Section 3. Some Theoretical Comparisons 

The analysis presented above differs in a number of ways from 
more traditional linguistic descriptions. It is important to note 
which aspects represent actual departures In theory* Instead of 
Rotational changes, 

first* the idea of an interpretive rather than the usual 
generative grammar does not really represent a theoretical disagreement, 
A "generative grammar" in the sense originally intended is "a 
system of rules that in some explicit and well defined way assigns 
structural descriptions to sentences' 1 [Chomsky 1965, p.S], A program 
for interpretation serves as such an explicit and welI-dofined 
set of rules in the Same way as a set of productions and transformations. 
This interchangeablety is particularly easy to see in the case of 
a finite-state grammar, which can be equally welt defined by a set 
of productions or by the description:'Of a finite-state acceptor. It 
applies as well to more complex languages. 

The advantage of viewing language through interpretation Is 
not one of theoretical power* but of Convenience and simplicity of 
description. It is particularly imporant in interconnecting the 
various aspects of language - syntax, semantics, phonology, etc* In 
a meaningful way. 

In some senses, the programs described above act very much like 
"transformations". The u -er‘' procedure transforms a semantic definition 
into "bake" into one for "baker" following a fixed algorithm. There 
are two basic differences between this and the usual sense of 
"transformation" in linguistics. First, as described in the previous 
section, the algorithm may include computations which involve 
all levels of analysis, including the phonological form of the word, 
or the semantic features of the object or objects it refers to, 


This does mot mean that the rules are any less "explicit' 1 or "■well 
defined", but that they have a much, wider scope. The belief underlying 
this is that no set of rules operating at one level can adequately 
describe language. The interaction between levels Is critical to 
the description of each of them. 

Second* the transformations operate on an explicit semantic 
rather than syntactic representation of meaning. There is no attempt 
to have some syntactic "deep structure" represent all of the meaning. 

For example,, in the case of “baker'^ there must be some explanation 
for the addition of the meaning 'as an occupation". "Baker" cannot 
be derived by a syntactic transformation from “a person bakes"* since 
syntactic transformations cannot add meaning. It therefore must 
come from a deep structure which involves a sentence like "a person 
does 5 as an occupation". Similarly, each semantic fact about the 
interpretation of an. ending must be expressed by making up deep structures 
with hypothetical additional sentences to express the meaning. See 
[Lees- 1960] or [Chomsky 19&9] for a description of many of the difficulties 
Involved in describing English nominalIzatlon* 

The main thesis of this "artificial Intelligence" approach to 
describing language Is that it is both unnecessary and unwise to try 
to separate the syntax of language from its meaning, or its meaning 
from context and non-linguistio knowledge. The Importance of language 
lies in its ability to interrelate these, and its structure is 
organized to achieve that purpose. By using tools Such as procedural 
descriptions, a linguist can deal explicitly with the interconnections, 
and describe language as an organized system for the communication 
of meaning. 
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